Linear Combination of SOMs for Data Imputation: Application to Financial Problems Linear Combination of SOMs for Data Imputation: Application to Financial Problems

نویسندگان

  • Antti Sorjamaa
  • Francesco Corona
  • Yoan Miche
  • Paul Merlin
  • Bertrand Maillet
  • Eric Séverin
  • Amaury Lendasse
چکیده

This paper presents a new methodology for missing value imputation in a database. The methodology combines the outputs of several Self-Organizing Maps in order to obtain an accurate filling for the missing values. The maps are combined using MultiResponse Sparse Regression and the Hannan-Quinn Information Criterion. The new combination methodology removes the need for any lengthy cross-validation procedure, thus speeding up the computation significantly. Furthermore, the accuracy of the filling is improved, as demonstrated in the experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Linear Combination of SOMs for Data Imputation: Application to Financial Database

This paper presents a new methodology for missing value imputation in a database. The methodology combines the outputs of several Self-Organizing Maps in order to obtain an accurate filling for the missing values. The maps are combined using MultiResponse Sparse Regression and the Hannan-Quinn Information Criterion. The new combination methodology removes the need for any lengthy cross-validati...

متن کامل

Combination of SOMs for Fast Missing Value Imputation

This paper presents a methodology for missing value imputation. The methodology is based on a combination of Self-Organizing Maps (SOM), where combination is achieved by Nonnegative Least Squares algorithm. Instead of a need for validation as when using traditional SOMs, the combination proceeds straight into final model building. Therefore, the methodology has very low computational time. The ...

متن کامل

Application of different inverse methods for combination of vS and vGPR data to estimate porosity and water saturation

Inverse problem is one of the most important problems in geophysics as model parameters can be estimated from the measured data directly using inverse techniques. In this paper, applying different inverse methods on integration of S-wave and GPR velocities are investigated for estimation of porosity and water saturation. A combination of linear and nonlinear inverse problems are solved. Linear ...

متن کامل

APPLICATION OF KRIGING METHOD IN SURROGATE MANAGEMENT FRAMEWORK FOR OPTIMIZATION PROBLEMS

In this paper, Kriging has been chosen as the method for surrogate construction. The basic idea behind Kriging is to use a weighted linear combination of known function values to predict a function value at a place where it is not known. Kriging attempts to determine the best combination of weights in order to minimize the error in the estimated function value. Because the actual function value...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010